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Abstract 

In this paper we obtain the expectation and variance of the number 
of Euler tours of a random Eulerian directed graph with fixed out-degree 
sequence. We use this to obtain the asymptotic distribution of the number 
of Euler tours of a random d-in/d-out graph and prove a concentration 
result. We are then able to show that a very simple approach for uniform 
sampling or approximately counting Euler tours yields algorithms running 
in expected polynomial time for almost every d-in/d-out graph. We make 
use of the BEST theorem of de Bruijn, van Aardenne-Ehrenfest, Smith 
and Tutte, which shows that the number of Euler tours of an Eulerian 
directed graph with out-degree sequence d is the product of the number 
of arborescences and the term ^[ni,gv('^" ~ ^)']- Therefore most of our 
effort is towards estimating the moments of the number of arborescences 
of a random graph with fixed out-degree sequence. 



1 Introduction 

1.1 Background 

Let G = (V, E) be a directed graph. An Euler tour of G is any ordering 
e^(i), . . . , e.Tj(]^E\) of the set of arcs E such that for every 1 < i < \E\, the target 
vertex of arc e.n-(i) is the source vertex of e.n^(i+i), and such that the target vertex 
of e7r(|_E|) is the source of e^(i). We use ET{G) to denote the set of Euler tours 
of G, where two Euler tours are considered to be equivalent if one is a cyclic 
permutation of the other. It is a well-known fact that a directed graph G has 
an Euler tour if and only if G is connected and if for each v ^ the in-degree 
and out-degree of v are equal. In this paper, we are interested in the number of 
Euler tours of a random Eulerian directed graph with fixed out-degree sequence. 
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Let d = (di, ^2, . . .) be a sequence of positive integers. We let Gn be the space 
of all Eulerian directed graphs on vertex set [n] = {1, 2, . . . , n} with out-degree 
sequence (di, ^2, . . . , d„). We use m = J2ve[n] '^^ t° denote the number of arcs 
in a graph G G Q"^. In the case where di = dj for all «,j e [n], we refer to 
the graphs as d-in/d-out graphs and denote this set by G'^ '^- In this paper, we 
obtain asymptotic estimates for the first and second moments of the number of 
Euler tours of a uniformly random G G t/^, for any fixed out-degree vector d. 

Using the estimates of the moments, we determine the asymptotic distribu- 
tion of the number of Euler tours of a random G £ G^ '^ ■ Similar results have 
previously been obtained for various structures in the case of undirected regular 
graphs. For example, the asymptotic distribution has already been characterised 
for Hamiltonian cycles [HI [T31 [S] , 1-factors [9^ , and 2-factors [TT] , in the case 
of uniformly random d-regular undirected graphs. In each of these results, one 
of the goals was to prove that the structure of interest occurs in G with high 
probability when G is chosen uniformly at random from the set of all undirected 
d-regular graphs. Since every connected d-in/d-out graph has an Euler tour, the 
existence question is not of interest here. However, in the case of Hamiltonian 
cycles the asymptotic distribution was further used by Frieze et al. j5] to prove 
that very simple algorithms for random sampling and approximate counting of 
Hamiltonian cycles run in expected polynomial time for almost every d-regular 
graph. This paper contains analogous counting and sampling results for Euler 
tours of d-in/d-out graphs for d > 2. 

Our result uses a well-known relationship between the Euler tours and ar- 
borescences of an Eulerian graph. An arborescence of a directed graph G = 
{V, E) is a rooted spanning tree of G in which all arcs are directed towards the 
root. We will use ARBS{G) to denote the set of arborescences of G and, for 
any v & use ARBS{G, v) to denote the set of arborescences rooted at v. For 
any Eulerian directed graph G, the BEST Theorem (due to de Bruijn and van 
Aardenne-Ehrenfest Tf], extending a result of Smith and Tutte ^T^) reduces 
the problem of computing \ET{G)\ to the problem of computing \ARBS{G,v)\, 
for any vertex v (Iz V . 

Theorem 1 ([TH [T7]). Let G — {V,E) be an Eulerian directed graph with 
out-degree seguence d. For any v V , we have 



\ET{G)\ 



\ARBS{G,v)\. (1) 



The above theorem enables exact counting or sampling of Euler tours of any 
Eulerian directed graph in polynomial time. For any given digraph G — {V, E), 
the well-known Matrix-tree theorem shows that for any v € V the number of 
arborescences into v E V exactly equals the value of the {v, i;)-cofactor of the 
Laplacian matrix of G (see, for example, [H]). Colbourn et al. [1] gave an algo- 
rithm allowing sampling of a random arborescence rooted at v to be carried out 
in the same time as counting all such arborescences. Hence, applying the BEST 
theorem stated above, the twin tasks of exact counting and uniform sampling 
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of Euler tours of a given Eulerian digraph on n vertices can be performed in 
the time to evaluate the determinant of an n x n matrix, which at the time 
of writing is 0{n'^) for c < 2.3727[IB]. An alternative approach to sampling is 
presented in [TU] , 

1.2 Naive algorithms 

In this paper, we take a different approach and consider a very naive algorithm 
for sampling Euler tours of an Eulerian digraph. To describe this algorithm, 
it helps to introduce the concept of a transition system of an Eulerian digraph 
G = {V,A): for every v G V, consider the set In{v) of arcs directed into v, 
and the set Out{v) of arcs directed away from v (in a multi-graph we allow the 
possibility that In{v) n Out{v) 7^ 0). We define a pairing P{v) at w to be a 
matching of In{v) with Out{v). Finally we define a transition system of G to 
be the union of a collection of pairings, one for each vertex of the graph. We 
let TS{G) denote the set of all transition systems of G. If G has the out-degree 
sequence di,...,(i„ (for n = \V\), then \TS{G)\ = nr=i '^i'- ^ote that every 
Euler tour of G induces a unique transition system on G. 

Our naive sampling algorithm presented in Figure [T] generates a random 
transition system for G and tests whether it induces an Euler tour. 

Algorithm Sample(G = {V, A)} 
for V G V do 

Choose a pairing P{v) of In{v) with Out{v), drawn uniformly at random 
from all pairings, 
end for 

if Ut,gyP(w) induces an Euler tour T on G then 

return T 
else 

return 
end if 

Figure 1: Algorithm Sample 

We make two simple observations. First, observe that Sample(G = {V,A)) 
generates all transition systems of G with equal probability. Hence all transi- 
tion systems corresponding to an Euler tour will be generated with a uniform 
probability (which is [0"=! '^*']~^)- Second, the probability that one execution 
of Sample(G = (V.A)) returns an Euler tour is exactly \ET{G)\/\TS{G)\ = 
\ETiG)\x[l]-^,d,\]-\ 

In Figure O we present our simple approximate counting algorithm. Ob- 
serve that for any given k e N, that the expectation M[k/K] of the value re- 
turned by Approximate(G = {V,A),k) is \ET{G)\/\TS{G)\. However, the 
probability that the value returned by Approximate(G = {V,A),fi) will be 
close to \ET{G)\/\TS{G)\ depends both on k and on the value of \ET{G)\. 
If we are given a graph G whereby \ET{G)\ is guaranteed to be larger than 
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Algorithm Approximate(G = {V, A),k) 
k 0; 

for 2 = 1 ^ K do 

T 4- Sample(G') 

if T 7^ then 
k -.^ k + 1; 

end if 
end for 
return k/n 

Figure 2: Algorithm Approximate 



n"=i "^j-i where p{n) is some fixed polynomial in n, then by setting k 
appropriately we can guarantee that with high probability Approximate (G = 
(V, A), k) will return a close approximation of \ET{G)\/\TS{G)\. However, there 
exist Eulerian digraphs where the number of Euler tours is only an exponentially 
small multiple of Y[i=i '^i'- 

In this paper we consider the performance of Sample and of Approxi- 
mate on random regular Eulerian digraphs of bounded degree d. Our goal 
will be to show that as the number of vertices grows, that for some k polyno- 
mial in 71, the probability that Approximate returns a close approximation 
of \ET{G)\/\TS{G)\ tends to 1. This requires that we can demonstrate two 
things: (i) that the expected number of Euler tours of a random Eulerian di- 
graph of fixed degree is polynomially-related to \TS{G)\ = (d!)"; that is, there 
is some h > such that the expected number of Euler tours is greater than 
n~^{d\)'^\ (ii) that \ET{G)\ on random d-regular Eulerian digraphs is concen- 
trated within a window of this expected value. 

Note that our algorithms for sampling and approximate counting of random 
Eulerian digraphs have previously been analysed for the case of Eulerian tour- 
naments in [8j. This was done as part of their analysis of Euler tours on the 
undirected complete graph with an odd number of vertices. It does not overlap 
our research - tournaments are regular of degree {n — l)/2. 

1.3 Our proof 

The results in this paper are of an asymptotic nature. If a„ and 6„ are se- 
quences of numbers, we take a„ — >■ 5„ to mean lim„_j.oo a„/6„ — 1. Given a 
sequence of random variables A„ and random variable Z , we say A„ converges 
in distribution to Z , or Z has the asymptotic distribution of if 

hm P[A„ < x] = ¥[Z < x] . 

We generate graphs in using a directed version of the configuration 
model [2j[3]. We define the configuration space $5? follows. For each v G [n], 
let Sv and Ty be disjoint c?^-sets and let S — U„g[„]S't, and T = U„g[„]T^. We say 
Sy is the set of configuration points available for arcs leaving v and Ty is the set 
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of points available for arcs entering v. A configuration is a perfect matching 
from S to T and $^ is the set of all configurations. Note that j^^j — ml. Each 
configuration F e $^ projects to a directed multi-graph a{F) by identifying the 
elements of Sy and Ty. That is, cr(F) has an arc (m, v) for each pair from Su x Ty 
that is contained in F. This model has been analysed in [1] Section 7], to obtain 
an estimate of the expected number of Euler tours of a random G G Gn''^ for the 
case d — 2. One nice property of the model, and of the original configuration 
model, is that directed graphs (without loops or double arcs) are generated with 
equal probability. Hence, by studying properties of uniformly random configu- 
rations it is possible to infer results about uniformly random elements of Gn j by 
conditioning on there being no loops or double arcs. 

In Section [2l we consider the configuration model for general (bounded) 
degree sequences. We first prove a useful combinatorial lemma; then in Theo- 
rem [2] we derive and prove exact expressions for the first and second moments 
for the number of arborescences of cr(F), when F is a configuration drawn uni- 
formly at random from Next, in Theorem [31 we condition on the event that 
(t{F) is a simple graph, to derive close approximations for the first and second 
moment, for the number of Arborescences, when G is a simple graph drawn uni- 
formly at random from Gn ■ As an immediate corollary we obtain corresponding 
approximations for the first and second moment when the random variable is 
the number of Euler tours. The expected value for the number of Euler tours 
over Gn is shown in Corollary [T] to tend to the value ^(nt)e[n] ^I'Oi which is a 
^ fraction of |r5'(G)|. Therefore point (i) of Subsection [fj holds. 

In the analysis of random structures, it is sometimes the case that we can 
prove concentration (of a random variable within a fixed range) by applying 
Chebyshev's inequality to the first and second moment of that random variable. 
In the final part of Section [2] we show that the values of the first and second 
moments for Euler Tours in G^ sre not good enough to prove concentration of 
measure using Chebyshev's inequality. 

It is for the above reason that in Section[3]we use a more complicated method 
to show that the number of Euler tours for G S C/^ is asymptotically almost 
surely close to its expectation. The proof idea we use to obtain an asymptotic 
distribution is that of conditioning on short cycle counts, pioneered by Robinson 
and Wormald [HJ [T3]. Implicit in this pair of papers (and the subsequent 
work of Frieze et al. [5]) is a characterisation of the asymptotic distribution 
of the number of Hamiltonian cycles in a random d-regular graph in terms of 
random variables counting the number of i-cycles, for all fixed positive integers 
i. Janson [6] streamlined the technique of Robinson and Wormald and proved a 
general theorem (stated by us as Theorem 2]). In Section SI we use Theorem 2] 
to obtain an asymptotic distribution for the number of Euler tours of a random 
d-in/d-out graph. 
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2 Expectation and Variance of Euler tours 



In this section, we obtain the expectation and variance of the number of Euler 
tours of a random d-in/d-out graph. We will use two particular facts several 
times in the proofs of this section. Recall the definition of falling factorial 
powers: for every n,k G N, 

{n)k = n{n - l){n - 2) • • • (n - fc + 1) . 

Fact 1. Falling factorial powers of sums obey the well known multinomial the- 
orem 




where the sum is taken over all partitions of k into I non-negative integer parts. 

Fact 2 (see, e.g., [11]). Let V = {l,2,...,n}, and let 5 ^ {5.o : v e V} he a 
given vector of non-negative integers. The number of k- forests on V in which v 
has dy children is 

/ n — l\ / n ~ k \ 

[k -l)\S, -.v ev) ' 

We use Fact [1] and Fact [2] to prove the following lemma. In this lemma, 
and in the proofs of subsequent results, we will speak of a configuration for an 
(in-directed) arborescence or forest. We take this to mean a partial matching 
from S to T (in the configuration model) that projects to an arborescence or a 
forest. 

Lemma 1. Suppose we have a set of vertices V — [n] for which there are Xy 
points for arcs entering v Cz V and Tjy points for arcs leaving v d V , with Xy not 
necessarily equal to yy . Then, the number of ways to choose a configuration for 
an in-directed forest rooted at R CV is 

\veV\B. J \veR / \veV / n-\R\-l 

Proof. Let be a forest on [n] rooted at R and let Sy be the number of children 
of V in J^, for each v € V. The number of ways to choose points for the source 
and target vertex of each arc in J- is 




since we must choose a point for the start of the arc directed away from each 
V ^ R and choose one of the Xy points for the end of each of the Sy arcs directed 
towards each v ^V. 
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Let k = X^-ue-R '^^^ construct a forest rooted at R by first choosing a 

fc-forest on y \ i?, and ttien attaching each root of this forest as a child of some 
V E R. The reason we take this approach is to allow us to use Fact [21 which 
is not explicitly set up to allow us to specifiy particular roots. By Fact [21 the 
number of A:-forests on y \ i? in which v G V \ R has exactly children is 



n- |i?| - 1\ / n-\R\-k 
fc - 1 )\6^:v(EV\R 



(4) 



and the number of ways to divide the roots of this forest amongst the members 
of R so that each v & R has Sy children is 

s^-.I^r)- 

Combining and (O and summing over all possible values for Sy gives 



i: (.:rJfn«) n 

^(E„ev\H'5-)="-|«|-fc veV\R 



n y- 



(6) 

By Fact [T] we see that the two sums over the different Sy in © are expansions of 
the falling factorial powers {J2yeR^v)k and {J2yev\R^v)n^\R\-k , respectively. 
Hence, ((6|) is equal to 



C^Xy)k{ ^ X„)„_.|fl,|_fc . 

veR^ vGV\R 

Applying Fact [H again gives □ 

We now use Lemma[Tlto analyse the expectation and variance of the number 
of arborescences in (j{F), when F is chosen uniformly at random from $5^. We 
say ^ C is an arborescence oi F G if a- {A) is an arborescence of a-{F). 
In the following proofs, we will abuse terminology slightly and switch between 
speaking of arborescences of configurations and directed graphs arbitrarily. We 
will define ARBS{F), for any F G Q^,tohe the set of partial matchings on S'x T 
which project to an Arborescence on [n\. 

Theorem 2. Let d — (di,(i2,...) be a sequence of positive integers. For each 
n € N, let denote the number of arborescences (rooted at any vertex) of a 




7 



uniformly random F G <I>^. Then, 



n 



mi] = - 



TO 



TO 



m — n + I 

Proof. We start by computing the first moment of A^- To calculate the first 
moment of we need to enumerate pairs {F,A), where F G $^ and A is 
an arborescence of F, and then divide this quantity by \^n\- Given A, it is 
easy to count the number of configurations F G <I>^ for which A C F. In any 
directed graph G with to arcs, there are exactly m — n + 1 arcs not contained 
in any particular element of ARBS{G). Hence, if we have a configuration for 
an arborescence, there are (to — n + 1)! ways to extend this to a complete 
configuration. Applying Lemma [T] with s = t = d, we see that the number of 
arborescences rooted at any particular vertex v is 

dv\ W du \ (jn~ l)„-2 . (7) 
\«eH\{t,} / 

By the BEST theorem (Tlieorem[T]), there are an equal number of arborescences 
rooted at each vertex of any F G Hence, multiplying ([7]) by by n(TO — n + 1)! 
gives the number of pairs {F,A) with F G and A G ARBS{F): 

n(TO-l)! W d„ 

\ve[n\ 

Finally, dividing by the total number of configurations in <J>^, which is to!, gives 
the claimed value for ]E[.4J^]. 

To compute the second moment of A*^ we need to evaluate the following 
expression 

— \ARBS{F)\\ (8) 
to! ^-^ 

We observe that the term \ARBS{F)\^ in ([8]) is equal to the number of elements 
in the set 

{{A,A^) : ^ ARBS{F)) . 



That is, 



where 



$d = {[F,A,A!) : F G ^i,A,A' G ARBS{F)\} . 
Hence, to evaluate E[(y^*)^] we need to count the number of elements of 
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We compute |$^| as follows. First, we count the number of ways to choose 
the intersection of a pair of arborescences A and A' . Then, we count the number 
of ways to extend this intersection to A and A' . Finally, we count the number 
of ways to choose the remainder of F so that A and A' are both in ARBS{F). 

We start by considering the final stage. Suppose we have a pair of arbores- 
cences {A, A') of some configuration F e <i>^ and suppose = ^ n .4' is a forest 
rooted at i? C [n]. Since we need to add — 1 arcs to T for each arborescence, 
there will be n + |i?| — 2 edges in ^ U ^' and, hence, there are (m — n — + 2)! 
ways to choose the remaining edges for F. 

Now we examine the number of different pairs {A, A') with F = ACiA' 
rooted at R. In the analysis that follows, we will overcount slightly, with the 
pair of Arborescences {A, A') depending on the roots of A and A'. We use the 
BEST Theorem (Theorem [T]) to get back to the correct number at the end of 
the proof. 

We start by counting the number of ways we can choose F, the edges in 
both arborescences, and then count the number of ways to choose the edges 
which are in one or the other arborescence. By Lemma [U the number of ways 
to choose F rooted at R is 

f n dA[J2dA (m-l)„_|;,|_i. (9) 
/ \veR J 

For each v € R, let Fy denote the component of F with root v, and let 
be the number of points in [J^^^jr not used by arcs in F. That is, 

Xy ^ ^ du- \Fy\ + 1 . 

Note that this is the number of points available to add arcs directed towards 
vertices of Fy when we are completing A and A' . Moreover, we have 

Xy — m — n + \R\ . 

We now turn our attention to the number of ways to choose A\A' and 
A' \ A. Choosing the remaining arcs for A and A' is equivalent to choosing a 
pair of disjoint configurations for trees on R in which there are Xy points available 
for the targets of arcs entering v and dy points available for the sources of the 
arcs leaving u, for each v (z R. 

Suppose we have already chosen ^ \ such that the root of ^ is r and 
suppose that there are Sy arcs from ^ \ directed towards vertices in Fy , for 
each V E R. Now, suppose we want to choose A' \A such that the root of A' is 
r' , and, for the moment, suppose r ^ r' . Choosing Al\A amounts to choosing a 
tree on R rooted at r' in which there are Xy — Sy points available for arcs directed 
towards each v, dy — 1 points available for the source of the arc directed away 
from each v r, and dr points available for the source of the arc directed away 
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from r. Hence, by Lemma [TJ we see that the number of ways to choose A'\A 
is 



Ylidv - 1) j (m-7i)|^|_2. 



{dr - l){dr> - 1) 

Using Fact [51 we can deduce that the number of ways to choose A is 



dr ^ \Sr — 1; 5t, : u e i?\{r}y 

(5r>l 

Therefore, the number of ways to complete T lo A^A! is equal to 
times 



\S\ = \R\-1 ^ ' \L J/ ,„g^ 

<5,->l 

We can divide ((TU]) into two sums: 



(10) 



and 



|«| = |H|-1 ^ ' \ I- J / ^g/J 



,?/"U-i;r:;:.\w)n(-... 



(12) 



(5r>l 



Applying Fact [U we see that (fTT|) and ([T2|) are equal to a;ra;r'(m— — l)|i(;|_2 
and —XrXrii\R\ — 2)(m — n + |i?| — 2)|fl|_3 respectively. Hence, the number of 
ways to complete T \.o A^A! is 



{dr - l)idr' - 1) 



.veR J 



m-n+|i?|-2)2|K|_4. (13) 



If r = r', we can apply an almost identical argument to show that the number 
of ways to complete to ^ U is 

(n '^^('^^ -!))(--"+ \R\ 2)2|;,|_4 . (14) 

Multiplying (IT^ and by (d^ ^ l)((ir' — 1) and dr{dr — 1) respectively, and 
summing over r and r' gives 

^ XrXr' + ^a;r(xr - 1) I J]^ rft,(dt, - 1) j (TO~n.+ |i?|-2)2|fl|_4 . (15) 

r^r' r I \v£R / 
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Since X^refl Xr — rn ~ n + \R\, we have 

^ X'pX'p^ I ^ X'p^^X'p 1^ " — ^ I 37^^ 1 I ^ ^3/^ 

= (to - n + |i?|)(TO -n+\R\-l). 

Hence, (fTS]) is equal to 

I n ^^('^^ - 1) ) (to - + |i?|)2|fi|-2 • 

Multiplying by the number of ways to choose given in and the num- 
ber of ways to choose the portion of F not contained in AUA', which is 
(to — n — |-R| + 2)!, yields the following expression 




i)!(n(^''-i)) (Em ■ (16) 

\veR J \veR J 



The expression ([T6l) over-counts the number of triples {F, A, A') in which the 
intersection AoA' is a forest rooted at R. Each triple {F,A,A') in which A 
and A' are rooted at different vertices u and v is counted ((i„ — l)(d^ — 1) times, 
and each triple {F, A, A') in which A and A' are rooted at the same vertex v is 
counted dy{dv — 1) times. 

Only the second two factors of p6p depend on R. Summing these over all 
R(-V gives 



RC[ 



E (E'^") (ll('^v-i)] , (17) 



We can evaluate (I17p by separating it into n separate sums, each corresponding 
to the sum over i? 9 for a particular v G [rt] , 

E n - 1) = - 1) I n ^« I ■ (18) 

RBvueR \ue[n] J 

Summing the right-hand side of (|18p over each v £ [n] and combining with the 
rest of gives 

]J d^, j (TO-n)(TO- 1)!. (19) 

We cannot immediately obtain the quantity we are looking for from (jl9p 
as it over-counts different triples by different amounts. However, by the BEST 
theorem (Theorem [T]) , we know that the number of triples {F,A,A') in which 
A is rooted at u and A' is rooted at v does not depend on u or d, since the 
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projection a{F) is always an Eulerian directed graph. Thus, it follows that the 
factor by which (|19[) over-counts the number of triples is 



- - + Mdv - 1) I = ^ '- . (20) 

Dividing dH]) by (HO)) and ml gives 

m(m — n + 1) 



niA:^] = — — T n 

m(m — n + 1) \ 

\ve[rL\ 



□ 



Recall that simple directed graphs are generated with equal probability in 
the configuration model. Thus, by conditioning on ct{F) containing no loops or 
2-cycles, we can obtain the first two moments of the number of arborescences 
of a uniformly random G € . 

Theorem 3. Let d be some fixed constant, let d = (di, (i2, . . .) he a sequence 
of positive integers satisfying di < d for all i, let n G N, and let m = X]"=i ^-v ■ 
Let An denote the number of arborescences of a directed graph chosen randomly 
from Q^. Then, as m — n ^ oo. 



E[X] ^ e- 
m 



Proof. In the following we will use m2 to denote d^,. 

The proof is as follows. We say F contains a loop at v if there is an edge 
from Sv X Tt, in F and that F contains a double arc from u to w if there is a 
pair of edges from 5„ x Tt, in F. Let L and D denote the number of loops and 
double arcs in a random F e Then, the event "F is simple" is equivalent to 
the event {L = D = 0}. We first analyse the distributions of L and D, which we 
can use to estimate the probability that F is simple. Then, we consider two new 
random variables, L^^'^ and D''^\ which count the number of loops and double 
arcs in F, when {F, A) is chosen randomly from the set 

'^={{F,A):Fe'^t,AeARBS{F)} (21) 

Hence, by analysing the distributions of L'^^-' and D'^^\ we can estimate 

p[l(i) = d(i) = 01 

MAn] = AnK] ■ 
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Finally, wc consider random variables, L*^^^ and wliieh eount the number 

of loops and double arcs in F, when (F, A, A') is chosen randomly from the set 



$d = {(^^ A'):Fe^i, A, A' e ARBS{F)} (22) 
Hence, by analysing the distributions of L^^^ and D^'^\ we can estimate 

P[L(^)=J^W=0] 

1 - p[i = i5 = o] ^t^-^") J • 

We note that the probability that a random directed graph G ^ contains 
any fixed subgraph H with more arcs than vertices is negligible. To see this, 
suppose H has r vertices and r+s arcs, where r and s are fixed positive integers. 
The number of ways to choose a partial configuration projecting to H is 0{n'^). 
However, the probability of a particular set of r + s edges being contained in 
a uniformly random configuration F is l/{dn)r+s- Thus, the probability <t{F) 
contains a subgraph isomorphic to H is 0(n~*). Then, since the number of 
different graphs on r vertices with r + s arcs is independent of n, wc can assume 
that the contribution to K[{L)j{D)k] from tuples of loops and double arcs with 
repeated vertices goes to as n oo, i.e., for any pair of positive integers j 
and k, we have 

E[iL),{D)k]^E[LYE[Df . 

Hence, L and D converge to a pair of independent Poisson random variables. 
The same is true for the random variables L^^^ and D^^\ and also for L^^^ 
and Thus, it suffices to compute the means of these random variables to 

estimate the appropriate probabilities. 

We first compute the expectation of L and D. Suppose we have a loop edge 
e G Sv X m F and let 1^ be the indicator variable for the event e E F. Then, 
we can write L = ^y^y X^eeS„xT„ -^e and, by linearity of expectation, we have 

nL] = T. E nie] = Yl E ^1^^^]. (23) 

veV eeS^xT^ veV eeS^xT^ 

Given e, the number of ways to choose F with e € F is (m — 1)!, so the 
probability of a random F G c&jj containing e is 1/m. For each v, there are 
ways to choose an edge from Sy xTy. Hence, 

E[L] = lVd^ = !^. (24) 

V 

Next, we compute the expectation of D. Here, for every pair of edges e, / e 
Su X T^,, for some u ^ v, wc define an indicator variable If,j for the event 
e, f G F. By linearity of expectation, wc have 

^[^] = E E E neJ&F]. (25) 

uev vev\{u} ejes^xT^ 



13 




The probability of a particular pair of edges e and / occurring in a random 
configuration F G is, asymptotically, l/m?. Moreover, the number of ways 
to choose e, f ^ Su X Ty is 2(*') ('2")- Hence, the sum in (^5]) becomes 

uev vev\{u} \ ^ \ 
2 

(26) 

To finish the calculation we observe that the numerator of the negative term 
in ([26)1 is 0(m) (each du is bounded above by a constant d, so X^uC'^")! ^ d^m). 
Hence, this part of the sum disappears as m — > 00 and we sec that 

m^'-^. (27, 

Recall that L and D converge to independent Poisson random variables and, 
therefore, the probability that F is simple when F is chosen uniformly at random 
from $^ satisfies 

Next, we consider the distributions of L^^^ and D'^^\ We first estimate 
EfL^^)]. Suppose we have a loop edge e € x Ty, for some v £ V. A loop 
edge cannot be contained in any arborescence, and, thus, the number of pairs 
{F,A) e $^ where e G F, is equal to the number of pairs {F,A) G where 
d' is equal to d with dy replaced by d„ — 1. Hence, from Theorem [21 we can see 
that the number of elements of $^ with e e F is equal to 

nidy-l)Y[du{m-2y.. (29) 

Dividing (|29l) by the total number of elements in which we can also obtain 
from Theorem [21 gives the probability 

P[eGF|(F,^)G¥^].-^^^ (30) 

Evaluating p3)) with this probability in the place of P[e e F] gives 

E[i(i)] = ^ V d.(d. - 1) ^ ^^^^ . 
771 — 1 — ' m 

V 

Next, we evaluate E[_D(^^]. Suppose we have a pair of edges e, f £ Su x Ty 
for some u ^ v. By Lemma [1] the number of arborescences rooted at u in which 
each w ^ {u, v} has dw points available for its incoming and outgoing arcs, u 
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has du points available for incoming arcs, and v has dy — 2 points available for 
incoming arcs and d^ available for outgoing arcs is 

(m-3)„_2. (31) 

The expression in (I3ip counts the number of partial configurations which consist 
of the edges e and / along with n — 1 configuration edges that project to an 
arborescence rooted at u. There are {m — n — 1)! ways to extend each of these 
partial configurations to some S Hence, the following expression counts 
the number of pairs {F, A) S $^ with e, f £ F and A rooted at u. 

(^f[d^^im-3)\. (32) 

By the BEST Theorem (Theorem [T|), we know that each g $^ has the same 
number of arborescences rooted at each vertex, so (|32]) counts exactly 1/n of 
the pairs {F,A) G $^ with e,f £ F. Multiplying ^ by n and dividing by |$^| 
gives 

F[eJ eF\{F,A)e^]-^ ^. (33) 

This is the same probability as when F is chosen uniformly at random from 
so evaluating p7|) with (p3|) in place of P[e, / € F] does not change the 
(asymptotic) value and we have 

E[D(i)] ^ E[D] . 

Again, since L(i) and D'-^^ converge to independent Poisson random variables 
we have that the probability of F being simple in a random {F, A) £ $5i satisfies 

P,.....Z,«.0|.„p(--_^-^I^). (34) 

Together (|28p and (p4)) give the claimed estimate for E[^„]. 

Finally, we consider the distributions of i'^^-' and D'^^\ First, suppose we 
have a loop edge e £ Sv T^. The number of elements of $^ with e € is 
equal to the number of elements of where d' is the out-degree vector we 
used to compute E[i(^^]. By Theorem [21 we have 

\^\=^-^^7TW^^(I[d^ (-"2)!. 

Dividing by the number of elements in which know from Theorem [31 we see 
that 

F[e£F\{F,A,A')£^f,]-^^-^^-^. 
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Evaluating ([M)) with this probabihty in the place of P[e G F] gives 

E[L(2)] ^ "^^''"^ + " . (35) 



We now evaluate E[£'('^^]. Suppose we have a pair of edges e, f £ Su x Ty 
for some u ^ v. There are three cases to consider: e,f E AUA'; e,f ^ 
AUA'; or exactly one of e and / is in AU A' . We estimate E[Z3^^)] as follows. 
Using slightly more general arguments than those used to compute the second 
moment in Theorem [5J we count the number of triples {F,A,A') for each of 
these three cases, obtaining expressions which overcount in the same way as 
(|19p . Then, since the way in which triples are over-counted is the same in each 
of the three analyses, i.e., the number of times each triple {F,A,A') is counted 
is determined by the out-degrees of the roots of A and A' , we can add these 
three expressions together, apply the BEST theorem, and proceed as we did in 
the proof of Theorem [2l 

In each of the three cases, we want to count pairs of arborescences using 
some subset of the configuration points. Suppose we are working with sets of 
points where s„ — \Sy\ and ty — \Ty\ for each v, with Sy not necessarily equal 
to ty. Note that the fact that the in-degree and out-degree of each vertex v 
are equal is only used at the last step of the analysis of the second moment of 
.4* (in Theorem [2]). Thus, by following the arguments of the second part of 
Theorem [2] we find that, for each R QV, the expression over-counting triples 
{F,A,A') where ACiA' is a forest rooted at R (given by (fTH]) in the proof of 
Theorem [2]) becomes 

(em ( n ) (li'^ji^-^y-- (36) 

\weR / \weR / \wev ) 

The factor by which (l36l) over-counts (i^, ^, Al) is (s^ — l)(sr' — 1) if ^ and Al 
are rooted at different vertices r, r' S i?, and is Syi^Sy — 1) if both are rooted at 
the same vertex r € R. Summing p6p over all possibilities for R gives 

2 

V (37) 



Kwev ^ / \wev / 



Now, suppose e, f ^ AU A' ■ To enumerate the number of triples of this form 
we evaluate ([57)1 with Sw ~ for w ^ u, Su — du ~ 2, ty, ~ dy, for w ^ v, and 
ty = dy — 2, since we are removing two points from each of Sy and Ty. This 
gives 



dy dy 2 \ (dy 2 
m — n — ' 



|2 



2 



dy -2 dy J dl 
or, asymptotically, as m — n — > cxi, 

idu-2f 



\{dy,\ (m - 3)! 



\w£V 



(m - n) ^ "^/^ I n 1 (™ - 3)! . (38) 



\w£V 
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Next, suppose e,/ € AUA'. Since there can be at most one arc leaving u 
in A or A' it follows that we have an arc {u, v) in both A and A' . Hence, when 
we are choosing the pair of arborescences we must assume that (u, v) is always 
present. The corresponds to replacing u and u by a single vertex v' which has dy 
points available for outgoing arcs and du + dv — 2 points available for incoming 
arcs. That is, in this instance we work in a model where = tw = d^ for 
w ^ {u, v}, Syi = dy, and tyi = d^ + dy — 2. Evaluating (j37p with these values 
yields, asymptotically. 




(m-n)-— W d,y\ (m-3)!. (39) 



Given a pair of points from each of Sy and Ty, there are two ways to choose e 
and / and two ways to assign then to A and A' ■ Thus, the number of triples 
satisfying e,/ G A Li A' is, asymptotically. 




("^-")7T^ ("^-3)!. (40) 



Finally, suppose exactly one of e and / is in AuA'. This case is a little bit 
more complicated. Suppose e € A. We contract u and z; to a single vertex v' , 
as in the previous case, and choose a forest on V\{u,v} and v' with root R. 
Then, we proceed as in the proof of Theorem [21 except we consider u to be one 
of the roots of the components of T when choosing A' ; that is, when choosing 
A\A' we choose a tree on R, but when choosing A'\A we choose a tree on 
RU{u}, where the vertices in the component of our initial forest have now been 
divided between a component rooted at u and a component containing v. The 
final expression counting the number of ways to complete to ^ U .4' does not 
depend on how the vertices are distributed amongst the components of so it 
is safe to do this. In this way, we obtain the expression 



(d„-2)(^u ( n(-'^--i)) ( n 

\weR I \wen. I \wev\{u} 



sy, (m - 1)! (41) 



which over-counts the number of triples (F, ^,^') where i?U {u} are the roots 
of the components oiA^X, e e A, and f e F\{AUA'). 

Proceeding as before, by summing over all possibilities for R, gives 



, ,du-2 



n dyo] (m-1)!. 
wev / 



The cases where e G A' , f G A, and f E A' are all equivalent, so the expression 
over-counting triples {F,A,A') with exactly one of e and / in AuA' is 



, 4(d.-2) 



n dy}j (m-1)!. (42) 

wev / 
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Adding dSH]), (gHl), and dH]) gives 

(m-n) ( Jl J (m-3)!. (43) 

Finally, we observe that the triples are over-counted consistently in the three 
separate constructions given above. Hence, we can conclude, by the same rea- 
soning as was used in Theorem [5J that (|43l) over-counts the elements of <i>^ by 
a factor of 

(m-n-l)(m-n-2) 
Dividing by (|44l) and the number of elements in <I>^ gives 

This is the same probability for e, f d F when F is chosen uniformly at random 
from so we can conclude 

The random variables L^^-* and D^^-* converge to independent Poisson random 
variables. Hence, the probability that F is simple, when {F,A,A') is chosen 
uniformly at random from $5^, satisfies 

P[L(2) = 15(2) = 0] ^ exp ^ — ^ (45) 

\ m 2m^ / 

Combining ((28)) and (|45p gives the claimed estimate for E(y^„)^. □ 

Given the expectation and variance of the number of arborescences of a 
random G G f/jj, we can, from the BEST Theorem (Theorem [T]), deduce the 
expectation and variance of the number of Euler tours of a uniformly random 

G e Gt 

Corollary 1. Let d be some fixed constant, let d = (di, d2, ■ ■ ■) be a sequence 
of positive integers satisfying di < d for all i, let n G and let m — X]"=i '^f- 
Let Tn denote the number of Euler tours of a directed graph chosen randomly 
from Gn ■ Then, as m — n 00, 



nTn] ^ - 



f G [n] 

m 



m — n 
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We now consider our estimates for the first and second moment in the con- 
text of Chebyshev's inequality, which states that for a random variable X with 
expectation and variance <j{X), that for any fc > 0, the probability that 

X deviates by more than ka{X) from its mean is bounded as follows: 



\X~E{X)\>ka{X)] < -1 



For us X is Tn, hence 



E[r„] 
iE[r„] 
E[r„] 



-,1/2 



1-- + :t - ' 1 

m ^ n m 2 m 



1 

2 m(m — n) 

-„2 



(1 



- (1-^) 

2m{m — n) 3m 

1/2 



— ) 

1/2 



n 

3m 

1/2 



1/2 



3m — n 
3(m — n) 



Assume d = (X]ue[n] '^f)/"-' ^^'^ note we have 2 < d < d. Then we have 
a{Tn) IE[r„]^(^i^)i/2 > 2-1/2^-1. Therefore, if we consider any k > 

y/2d in Chebyshev's inequality, we will have ka{Tn) > E[Tn]. In this case, 
Chebyshev's inequality covers a range of values for Tn as small as 0, which is 
not helpful for our analysis. Hence the only values of k which can potentially 
help show Tn is not too small are k < \/2d. However, this forces k to be upper 
bounded by the constant \/2d, meaning that the probability of the favourable 
outcome can be no greater than 1 ~ "^'^i which does not approach 1. Hence 
Chebyshev's Inequality cannot give the desired results. 

In the next section, we will show how to use results of this section and the 
estimates of Corollary [I] to obtain an asymptotic distribution for the number of 
Euler tours of a random G G Gn''^, from which we can derive a concentration 
inequality. 

3 Asymptotic distribution of Euler tours 

To compute the asymptotic distribution we will use the following general theo- 
rem of Janson [6] (see also [3 Chapter 9]). 
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Theorem 4 (Janson [5]). Let Ai > and 5i > —1, i = 1,2,..., be constants 
and suppose that for each n there are random variables Xin, i — 1,2,..., and 
Yn ( defined on the same probability space ) such that Xin is non-negative inte- 
ger valued and E[Y„] ^ Q (at least for large n) and furthermore the following 
conditions are satisfied: 

1. Xin Xiao (in distribution) as n ^ oo, jointly for all i, where Xioo is a 
Poisson random variable with mean A^; 

2. For any finite sequence xi, . . .Xk of non-negative integers 

k 

^ [[il + S^y^e asn^oo; 



3. A,<52 < oo; 

E[Y(l 



^- 1^ ^exp(E^A,52). 



Then 

Yn 



w = i[ii + s,y 



Moreover, this and the convergence in ([T]) holds jointly. The infinite product 
definingW converges a.s. andinL2, withE[W] — 1 anrfE[iy^] — exp(^,- A^Jf ) — 
liin„_>oo E[y„]^/E[y„]^. Hence, the normalised variables are uniformly square 
integrable. Furthermore, the event W = equals, up to a set of probability 0, 
the event that Xioo > for some i with Si — —1. In particular, W > a.s. if 
and only if every 6i > —1. 

In our application of Theorem 2] we will have Yn — T„, the random variable 
counting Euler tours of d-in/d-out graphs, and Xin equal to the number of 
directed z-cycles in a random d-in/d-out graph. To apply Theorem 3] we need 
the following two lemmas. 

Lemma 2. For each positive integer i let Xin count the number of directed 
i-cycles in a directed graph obtained as the projection of a uniformly random 
F S The variables Xin are asymptotically independent Poisson random 

variables with means E[Xin] = Xi = i- . 

Proof. Recall that the probability of a uniformly random G E Gn containing any 
particular fixed subgraph H with more arcs than vertices is negligible. Hence, 
we can assume that cycles occur independently, i.e., for any sequence of non- 
negative integers ki, . . . ,ki, we have 

- e 1 ^ 

J=l J i=l 
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Hence, the random variables Xin, . . . , X^n converge to a set of independent 
Poisson random variables. Thus, all that remains is to estimate E[Xi„]. 

We say a set of i edges ei, 62, . . . , in a configuration is an i-cycle if there is 
a sequence of distinct vertices vi,V2, ■ ■ ■ ,Vi such that ej € S^^ x Ty.^^ for j < i 
and Ci S Sy. x Ty-^ . The probability of any particular i-cycle being contained in 
a random e $^ is 

{dn-iy. 1 
{dn)\ ^ {dny ' 

So, to estimate E[Xi„] all we need to do is count the number of different i- 
cycles that can occur in some F € <I>^ and then divide by {dny. Let / be some 
i-subset of [n\. There are (i — 1)! different ways to arrange / into an i-cycle 
{vi,V2, ■ ■ ■ ,Vi) and then c?^' ways to choose edges ej G Sy- x Ty-^-^ for 1 < j < i 
and Ci e Sy^ x Ty^ . Hence, 

and so E[Xi„] A^. □ 

Lemma 3. Let Xin be as in Lemma\3[ and let — Then, for any fixed 

set of integers ki,k2, ■ ■ ■ ,ki we have 



Proof. We only verify 



mi] 
nAix,„] 



mi] 

convergence of the factorial moments holds for the same reasons as were given 
in Lem ma 

Let ^i''' be the set defined in ([2T|) (for the case dy = d for all v) and let / be 
an i-subset of [n]. As in the previous lemma, there are (z — ways to choose 

a configuration for an z-cyclc on /. To estimate ^[A^ Xm] we need to calculate 
the probability that a particular i-cycle C is contained in F when {F, A) is 

chosen uniformly at random from Suppose C O A has c components. 

Pi, P2, • ■ • , Pc, each of which is a directed path, and let vj be the final vertex in 
the path Pj for 1 < j < c. Choosing the remainder of A is then equivalent to 
choosing an arborescence on {V\I) U {vj : 1 < j < c}, where we have collapsed 
each path to a single vertex. Each v E V\I has d points available for arcs 
directed towards or away from v. For each j = 1 . . . c, there are \Pj\{du — 1) 
points available for arcs directed towards Vj , and d — 1 points available for arcs 
directed away from Vj . Once we have chosen A, there are {dn — n — c— 1) ! ways 
to complete F. Hence, using Lemma [1] we can deduce that the number of ways 
to choose the remainder of {F, A) is 

n{d^iyd"'-'{dn-i-iy. 
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Summing over all the possible choices for P — {vi,V2, ■ ■ ■ ,Vc}, dividing by 
l^n'^l = and applying Stirling's formula yields 

□ 

Corollary 2. Let d > 2 be some fixed integer, and let Tn denote the number of 
Euler tours in a directed graph G chosen uniformly at random from G'^''^ ■ For 
any fixed set of integers fc2, . . . , fc^ we have 



E[r„] 



1=2 



We are now able to apply Janson's theorem to obtain an asymptotic distri- 
bution for the number of Euler tours of a uniformly random G € ^n''^- 

Theorem 5. Let d > 2 be some fixed integer, and let Tn denote the number of 
Euler tours in a directed graph G chosen uniformly at random from G'^''^ ■ Then, 



Zi 



E[r„] 

where the Zi are independent Poisson random variables with means d^ ji. 

Proof. It suffices to show that conditions ([ij to Q of Theorem |4] are satisfied 
by 7"n and : i > 2}, where Xi^ is the random variable counting z-cycles. 

Lemma [3 and Corollary [5] provide conditions ([1]) and ([2]) with 

1 

\i = — and Si ~ — -- . 

i d' 

With these values, evaluating the sum in condition ^ gives 



oo 

T — 



id"- d 

i=2 



^+l°g(T^)- (46) 



Finally, Corollary [T] provides condition ([4]). □ 



4 Generating and counting Euler tours 

We now turn to the analysis of Algorithm Sample in Section [TJ Note that 
although the algorithm is defined in terms of transition systems, it can also be 
considered as equivalent to a random directed walk on the Eulerian digraph 
or graph; terminating when we have used all outgoing edges of the starting 
vertex (whether we have created an Euler tour or not). This procedure was 
first considered in [8], where the authors considered it for undirected graphs 
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and showed that the expected number of runs needed to obtain an Euler tour 
is polynomial for the case of G = Kn for odd n. We consider the algorithm for 
G G Gn''^ and are interested in the quantitity 

\ETiG)\ 

(d!)" ^ ' 

The following theorem uses the results of the previous section (by a similar argu- 
ment to that used in [51 Lemma 1]) to show that this value is n(n~^) with high 
probability when G is chosen uniformly at random from G^''^- When this is the 
case, we can generate uniformly random Eulcr tours of G in expected polyno- 
mial time. Moreover, by setting the value of k in Approximate appropriately, 
we can approximate \ET{G)\. 

Theorem 6. Let d be some fixed integer, d > 2, and let G be chosen uniformly 
at random from Gn''^- Then, 



{diy 



1 



as n oo. 



Proof. Wc first note that by the estimate for E[T„] given in Corollary (TJ the 
statement above is equivalent to showing that 

P [Tn > n-^E[Tn]] ^ 1 . 

For X = (^2, . . . , Xfc) we define Gx to be the set of all d-m/d-out graphs containing 
exactly Xi directed cycles of length i for each i = 2 . . .k, and 



For each fixed 7 > we define 

S{-f) = {x:xi< X, + jXi for 2 < i < fc} . 

From Lemma [2] (and Lemma 3 of [5]), we can deduce that the probability 
of a random d-in/d-out graph G not being contained in Gx for some x € S{'y) 
is 0{e~°'^), where a is an absolute constant independent of 7. Hence, to verify 
the theorem all we need do is show that 

W{x.) > e-(''+'=)'' Vx e 5(7) , (48) 

where b and c are absolute constants independent of 7. For any particular b, c 
and 7, we can choose n sufficiently large so that e"'''^'^'*'' > n^^. Then, if 
holds, we have 



[Tn > n-^E[Tn]\ > 1 



e 



-a7 
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The above holds for any constant 7, and so can be taken as equal to 1 in the 
limiting case. 

So, it remains to prove ([^H)). For x G 5'(7) we have VF(x) = AB'' , where 

i>2 ^ ^ 
i>2 ^ ^ 

We can bound the right hand side of (HH]) as 

^ ^ fiexP (I - = exp (E--(^) ■ 

The sum inside the exponential is clearly convergent, so we can conclude that 
A > e^^ for some absolute constant b. Similarly, we can bound B by 

and, again, the sum in the exponential is convergent, so B'^ > e^'^'' for some 
absolute constant c. □ 
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